Pesquisa | Portal Regional da BVS

1.

Semantic representation of neural circuit knowledge in Caenorhabditis elegans.

Prakash, Sharan J; Van Auken, Kimberly M; Hill, David P; Sternberg, Paul W.

Brain Inform ; 10(1): 30, 2023 Nov 10.

Artigo em Inglês | MEDLINE | ID: mdl-37947958

RESUMO

In modern biology, new knowledge is generated quickly, making it challenging for researchers to efficiently acquire and synthesise new information from the large volume of primary publications. To address this problem, computational approaches that generate machine-readable representations of scientific findings in the form of knowledge graphs have been developed. These representations can integrate different types of experimental data from multiple papers and biological knowledge bases in a unifying data model, providing a complementary method to manual review for interacting with published knowledge. The Gene Ontology Consortium (GOC) has created a semantic modelling framework that extends individual functional gene annotations to structured descriptions of causal networks representing biological processes (Gene Ontology-Causal Activity Modelling, or GO-CAM). In this study, we explored whether the GO-CAM framework could represent knowledge of the causal relationships between environmental inputs, neural circuits and behavior in the model nematode C. elegans [C. elegans Neural-Circuit Causal Activity Modelling (CeN-CAM)]. We found that, given extensions to several relevant ontologies, a wide variety of author statements from the literature about the neural circuit basis of egg-laying and carbon dioxide (CO2) avoidance behaviors could be faithfully represented with CeN-CAM. Through this process, we were able to generate generic data models for several categories of experimental results. We also discuss how semantic modelling may be used to functionally annotate the C. elegans connectome. Thus, Gene Ontology-based semantic modelling has the potential to support various machine-readable representations of neurobiological knowledge.

2.

Biochemical pathways represented by Gene Ontology-Causal Activity Models identify distinct phenotypes resulting from mutations in pathways.

Hill, David P; Drabkin, Harold J; Smith, Cynthia L; Van Auken, Kimberly M; D'Eustachio, Peter.

Genetics ; 225(2)2023 Oct 04.

Artigo em Inglês | MEDLINE | ID: mdl-37579192

RESUMO

Gene inactivation can affect the process(es) in which that gene acts and causally downstream ones, yielding diverse mutant phenotypes. Identifying the genetic pathways resulting in a given phenotype helps us understand how individual genes interact in a functional network. Computable representations of biological pathways include detailed process descriptions in the Reactome Knowledgebase and causal activity flows between molecular functions in Gene Ontology-Causal Activity Models (GO-CAMs). A computational process has been developed to convert Reactome pathways to GO-CAMs. Laboratory mice are widely used models of normal and pathological human processes. We have converted human Reactome GO-CAMs to orthologous mouse GO-CAMs, as a resource to transfer pathway knowledge between humans and model organisms. These mouse GO-CAMs allowed us to define sets of genes that function in a causally connected way. To demonstrate that individual variant genes from connected pathways result in similar but distinguishable phenotypes, we used the genes in our pathway models to cross-query mouse phenotype annotations in the Mouse Genome Database (MGD). Using GO-CAM representations of 2 related but distinct pathways, gluconeogenesis and glycolysis, we show that individual causal paths in gene networks give rise to discrete phenotypic outcomes resulting from perturbations of glycolytic and gluconeogenic genes. The accurate and detailed descriptions of gene interactions recovered in this analysis of well-studied processes suggest that this strategy can be applied to less well-understood processes in less well-studied model systems to predict phenotypic outcomes of novel gene variants and to identify potential gene targets in altered processes.

Assuntos

Biologia Computacional , Bases de Dados Genéticas , Camundongos , Humanos , Animais , Ontologia Genética , Mutação , Fenótipo , Biologia Computacional/métodos

3.

Biochemical Pathways Represented by Gene Ontology Causal Activity Models Identify Distinct Phenotypes Resulting from Mutations in Pathways.

Hill, David P; Drabkin, Harold J; Smith, Cynthia L; Van Auken, Kimberly M; D'Eustachio, Peter.

bioRxiv ; 2023 Jul 13.

Artigo em Inglês | MEDLINE | ID: mdl-37293039

RESUMO

Gene inactivation can affect the process(es) in which that gene acts and causally downstream ones, yielding diverse mutant phenotypes. Identifying the genetic pathways resulting in a given phenotype helps us understand how individual genes interact in a functional network. Computable representations of biological pathways include detailed process descriptions in the Reactome Knowledgebase, and causal activity flows between molecular functions in Gene Ontology-Causal Activity Models (GO-CAMs). A computational process has been developed to convert Reactome pathways to GO-CAMs. Laboratory mice are widely used models of normal and pathological human processes. We have converted human Reactome GO-CAMs to orthologous mouse GO-CAMs, as a resource to transfer pathway knowledge between humans and model organisms. These mouse GO-CAMs allowed us to define sets of genes that function in a connected and well-defined way. To test whether individual genes from well-defined pathways result in similar and distinguishable phenotypes, we used the genes in our pathway models to cross-query mouse phenotype annotations in the Mouse Genome Database (MGD). Using GO-CAM representations of two related but distinct pathways, gluconeogenesis and glycolysis, we can identify causal paths in gene networks that give rise to discrete phenotypic outcomes for perturbations of glycolysis and gluconeogenesis. The accurate and detailed descriptions of gene interactions recovered in this analysis of well-studied processes suggest that this strategy can be applied to less well-understood processes in less well-studied model systems to predict phenotypic outcomes of novel gene variants and to identify potential gene targets in altered processes.

4.

Semantic Representation of Neural Circuit Knowledge in Caenorhabditis elegans.

Prakash, Sharan J; Van Auken, Kimberly M; Hill, David P; Sternberg, Paul W.

bioRxiv ; 2023 Sep 26.

Artigo em Inglês | MEDLINE | ID: mdl-37162850

RESUMO

In modern biology, new knowledge is generated quickly, making it challenging for researchers to efficiently acquire and synthesise new information from the large volume of primary publications. To address this problem, computational approaches that generate machine-readable representations of scientific findings in the form of knowledge graphs have been developed. These representations can integrate different types of experimental data from multiple papers and biological knowledge bases in a unifying data model, providing a complementary method to manual review for interacting with published knowledge. The Gene Ontology Consortium (GOC) has created a semantic modelling framework that extends individual functional gene annotations to structured descriptions of causal networks representing biological processes (Gene Ontology Causal Activity Modelling, or GO-CAM). In this study, we explored whether the GO-CAM framework could represent knowledge of the causal relationships between environmental inputs, neural circuits and behavior in the model nematode C. elegans (C. elegans Neural Circuit Causal Activity Modelling (CeN-CAM)). We found that, given extensions to several relevant ontologies, a wide variety of author statements from the literature about the neural circuit basis of egg-laying and carbon dioxide (CO2) avoidance behaviors could be faithfully represented with CeN-CAM. Through this process, we were able to generate generic data models for several categories of experimental results. We also discuss how semantic modelling may be used to functionally annotate the C. elegans connectome. Thus, Gene Ontology-based semantic modelling has the potential to support various machine-readable representations of neurobiological knowledge.

5.

The Gene Ontology knowledgebase in 2023.

Aleksander, Suzi A; Balhoff, James; Carbon, Seth; Cherry, J Michael; Drabkin, Harold J; Ebert, Dustin; Feuermann, Marc; Gaudet, Pascale; Harris, Nomi L; Hill, David P; Lee, Raymond; Mi, Huaiyu; Moxon, Sierra; Mungall, Christopher J; Muruganugan, Anushya; Mushayahama, Tremayne; Sternberg, Paul W; Thomas, Paul D; Van Auken, Kimberly; Ramsey, Jolene; Siegele, Deborah A; Chisholm, Rex L; Fey, Petra; Aspromonte, Maria Cristina; Nugnes, Maria Victoria; Quaglia, Federica; Tosatto, Silvio; Giglio, Michelle; Nadendla, Suvarna; Antonazzo, Giulia; Attrill, Helen; Dos Santos, Gil; Marygold, Steven; Strelets, Victor; Tabone, Christopher J; Thurmond, Jim; Zhou, Pinglei; Ahmed, Saadullah H; Asanitthong, Praoparn; Luna Buitrago, Diana; Erdol, Meltem N; Gage, Matthew C; Ali Kadhum, Mohamed; Li, Kan Yan Chloe; Long, Miao; Michalak, Aleksandra; Pesala, Angeline; Pritazahra, Armalya; Saverimuttu, Shirin C C; Su, Renzhi.

Genetics ; 224(1)2023 05 04.

Artigo em Inglês | MEDLINE | ID: mdl-36866529

RESUMO

The Gene Ontology (GO) knowledgebase (http://geneontology.org) is a comprehensive resource concerning the functions of genes and gene products (proteins and noncoding RNAs). GO annotations cover genes from organisms across the tree of life as well as viruses, though most gene function knowledge currently derives from experiments carried out in a relatively small number of model organisms. Here, we provide an updated overview of the GO knowledgebase, as well as the efforts of the broad, international consortium of scientists that develops, maintains, and updates the GO knowledgebase. The GO knowledgebase consists of three components: (1) the GO-a computational knowledge structure describing the functional characteristics of genes; (2) GO annotations-evidence-supported statements asserting that a specific gene product has a particular functional characteristic; and (3) GO Causal Activity Models (GO-CAMs)-mechanistic models of molecular "pathways" (GO biological processes) created by linking multiple GO annotations using defined relations. Each of these components is continually expanded, revised, and updated in response to newly published discoveries and receives extensive QA checks, reviews, and user feedback. For each of these components, we provide a description of the current contents, recent developments to keep the knowledgebase up to date with new discoveries, and guidance on how users can best make use of the data that we provide. We conclude with future directions for the project.

Assuntos

Bases de Dados Genéticas , Proteínas , Ontologia Genética , Proteínas/genética , Anotação de Sequência Molecular , Biologia Computacional

6.

Reactome and the Gene Ontology: digital convergence of data resources.

Good, Benjamin M; Van Auken, Kimberly; Hill, David P; Mi, Huaiyu; Carbon, Seth; Balhoff, James P; Albou, Laurent-Philippe; Thomas, Paul D; Mungall, Christopher J; Blake, Judith A; D'Eustachio, Peter.

Bioinformatics ; 37(19): 3343-3348, 2021 Oct 11.

Artigo em Inglês | MEDLINE | ID: mdl-33964129

RESUMO

MOTIVATION: Gene Ontology Causal Activity Models (GO-CAMs) assemble individual associations of gene products with cellular components, molecular functions and biological processes into causally linked activity flow models. Pathway databases such as the Reactome Knowledgebase create detailed molecular process descriptions of reactions and assemble them, based on sharing of entities between individual reactions into pathway descriptions. RESULTS: To convert the rich content of Reactome into GO-CAMs, we have developed a software tool, Pathways2GO, to convert the entire set of normal human Reactome pathways into GO-CAMs. This conversion yields standard GO annotations from Reactome content and supports enhanced quality control for both Reactome and GO, yielding a nearly seamless conversion between these two resources for the bioinformatics community. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

7.

Investigation of COVID-19 comorbidities reveals genes and pathways coincident with the SARS-CoV-2 viral disease.

Dolan, Mary E; Hill, David P; Mukherjee, Gaurab; McAndrews, Monica S; Chesler, Elissa J; Blake, Judith A.

Sci Rep ; 10(1): 20848, 2020 11 30.

Artigo em Inglês | MEDLINE | ID: mdl-33257774

RESUMO

The emergence of the SARS-CoV-2 virus and subsequent COVID-19 pandemic initiated intense research into the mechanisms of action for this virus. It was quickly noted that COVID-19 presents more seriously in conjunction with other human disease conditions such as hypertension, diabetes, and lung diseases. We conducted a bioinformatics analysis of COVID-19 comorbidity-associated gene sets, identifying genes and pathways shared among the comorbidities, and evaluated current knowledge about these genes and pathways as related to current information about SARS-CoV-2 infection. We performed our analysis using GeneWeaver (GW), Reactome, and several biomedical ontologies to represent and compare common COVID-19 comorbidities. Phenotypic analysis of shared genes revealed significant enrichment for immune system phenotypes and for cardiovascular-related phenotypes, which might point to alleles and phenotypes in mouse models that could be evaluated for clues to COVID-19 severity. Through pathway analysis, we identified enriched pathways shared by comorbidity datasets and datasets associated with SARS-CoV-2 infection.

Assuntos

COVID-19/mortalidade , COVID-19/patologia , Biologia Computacional/métodos , Animais , Doenças Cardiovasculares/epidemiologia , Doenças Cardiovasculares/genética , Comorbidade , Síndrome da Liberação de Citocina/mortalidade , Bases de Dados Genéticas , Diabetes Mellitus/epidemiologia , Diabetes Mellitus/genética , Modelos Animais de Doenças , Hepatite/epidemiologia , Hepatite/genética , Humanos , Nefropatias/epidemiologia , Nefropatias/genética , Pneumopatias/epidemiologia , Pneumopatias/genética , Camundongos , Síndrome do Desconforto Respiratório/mortalidade , SARS-CoV-2 , Índice de Gravidade de Doença

8.

Investigation of COVID-19 comorbidities reveals genes and pathways coincident with the SARS-CoV-2 viral disease.

Dolan, Mary E; Hill, David P; Mukherjee, Gaurab; McAndrews, Monica S; Chesler, Elissa J; Blake, Judith A.

bioRxiv ; 2020 Sep 21.

Artigo em Inglês | MEDLINE | ID: mdl-32995795

RESUMO

The emergence of the SARS-CoV-2 virus and subsequent COVID-19 pandemic initiated intense research into the mechanisms of action for this virus. It was quickly noted that COVID-19 presents more seriously in conjunction with other human disease conditions such as hypertension, diabetes, and lung diseases. We conducted a bioinformatics analysis of COVID-19 comorbidity-associated gene sets, identifying genes and pathways shared among the comorbidities, and evaluated current knowledge about these genes and pathways as related to current information about SARS-CoV-2 infection. We performed our analysis using GeneWeaver (GW), Reactome, and several biomedical ontologies to represent and compare common COVID-19 comorbidities. Phenotypic analysis of shared genes revealed significant enrichment for immune system phenotypes and for cardiovascular-related phenotypes, which might point to alleles and phenotypes in mouse models that could be evaluated for clues to COVID-19 severity. Through pathway analysis, we identified enriched pathways shared by comorbidity datasets and datasets associated with SARS-CoV-2 infection.

9.

Term Matrix: a novel Gene Ontology annotation quality control system based on ontology term co-annotation patterns.

Wood, Valerie; Carbon, Seth; Harris, Midori A; Lock, Antonia; Engel, Stacia R; Hill, David P; Van Auken, Kimberly; Attrill, Helen; Feuermann, Marc; Gaudet, Pascale; Lovering, Ruth C; Poux, Sylvain; Rutherford, Kim M; Mungall, Christopher J.

Open Biol ; 10(9): 200149, 2020 09.

Artigo em Inglês | MEDLINE | ID: mdl-32875947

RESUMO

Biological processes are accomplished by the coordinated action of gene products. Gene products often participate in multiple processes, and can therefore be annotated to multiple Gene Ontology (GO) terms. Nevertheless, processes that are functionally, temporally and/or spatially distant may have few gene products in common, and co-annotation to unrelated processes probably reflects errors in literature curation, ontology structure or automated annotation pipelines. We have developed an annotation quality control workflow that uses rules based on mutually exclusive processes to detect annotation errors, based on and validated by case studies including the three we present here: fission yeast protein-coding gene annotations over time; annotations for cohesin complex subunits in human and model species; and annotations using a selected set of GO biological process terms in human and five model species. For each case study, we reviewed available GO annotations, identified pairs of biological processes which are unlikely to be correctly co-annotated to the same gene products (e.g. amino acid metabolism and cytokinesis), and traced erroneous annotations to their sources. To date we have generated 107 quality control rules, and corrected 289 manual annotations in eukaryotes and over 52 700 automatically propagated annotations across all taxa.

Assuntos

Biologia Computacional/métodos , Ontologia Genética , Anotação de Sequência Molecular , Bases de Dados Genéticas , Evolução Molecular , Genoma Fúngico , Genômica/métodos , Controle de Qualidade , Schizosaccharomyces/genética , Navegador , Fluxo de Trabalho

10.

Cisplatin-resistant triple-negative breast cancer subtypes: multiple mechanisms of resistance.

Hill, David P; Harper, Akeena; Malcolm, Joan; McAndrews, Monica S; Mockus, Susan M; Patterson, Sara E; Reynolds, Timothy; Baker, Erich J; Bult, Carol J; Chesler, Elissa J; Blake, Judith A.

BMC Cancer ; 19(1): 1039, 2019 Nov 04.

Artigo em Inglês | MEDLINE | ID: mdl-31684899

RESUMO

BACKGROUND: Understanding mechanisms underlying specific chemotherapeutic responses in subtypes of cancer may improve identification of treatment strategies most likely to benefit particular patients. For example, triple-negative breast cancer (TNBC) patients have variable response to the chemotherapeutic agent cisplatin. Understanding the basis of treatment response in cancer subtypes will lead to more informed decisions about selection of treatment strategies. METHODS: In this study we used an integrative functional genomics approach to investigate the molecular mechanisms underlying known cisplatin-response differences among subtypes of TNBC. To identify changes in gene expression that could explain mechanisms of resistance, we examined 102 evolutionarily conserved cisplatin-associated genes, evaluating their differential expression in the cisplatin-sensitive, basal-like 1 (BL1) and basal-like 2 (BL2) subtypes, and the two cisplatin-resistant, luminal androgen receptor (LAR) and mesenchymal (M) subtypes of TNBC. RESULTS: We found 20 genes that were differentially expressed in at least one subtype. Fifteen of the 20 genes are associated with cell death and are distributed among all TNBC subtypes. The less cisplatin-responsive LAR and M TNBC subtypes show different regulation of 13 genes compared to the more sensitive BL1 and BL2 subtypes. These 13 genes identify a variety of cisplatin-resistance mechanisms including increased transport and detoxification of cisplatin, and mis-regulation of the epithelial to mesenchymal transition. CONCLUSIONS: We identified gene signatures in resistant TNBC subtypes indicative of mechanisms of cisplatin. Our results indicate that response to cisplatin in TNBC has a complex foundation based on impact of treatment on distinct cellular pathways. We find that examination of expression data in the context of heterogeneous data such as drug-gene interactions leads to a better understanding of mechanisms at work in cancer therapy response.

Assuntos

Antineoplásicos/uso terapêutico , Cisplatino/uso terapêutico , Resistencia a Medicamentos Antineoplásicos/genética , Genômica/métodos , Neoplasias de Mama Triplo Negativas/tratamento farmacológico , Animais , Evolução Biológica , Linhagem Celular Tumoral , Sequência Conservada , Transição Epitelial-Mesenquimal/genética , Feminino , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Humanos , Camundongos , Ratos , Receptores Androgênicos/metabolismo

11.

Gene Ontology Causal Activity Modeling (GO-CAM) moves beyond GO annotations to structured descriptions of biological functions and systems.

Thomas, Paul D; Hill, David P; Mi, Huaiyu; Osumi-Sutherland, David; Van Auken, Kimberly; Carbon, Seth; Balhoff, James P; Albou, Laurent-Philippe; Good, Benjamin; Gaudet, Pascale; Lewis, Suzanna E; Mungall, Christopher J.

Nat Genet ; 51(10): 1429-1433, 2019 10.

Artigo em Inglês | MEDLINE | ID: mdl-31548717

Assuntos

Biologia Computacional/métodos , Ontologia Genética , Modelos Biológicos , Anotação de Sequência Molecular , Transdução de Sinais , Bases de Dados Genéticas , Humanos , Fenótipo

12.

Deep fluid pathways beneath Mammoth Mountain, California, illuminated by migrating earthquake swarms.

Hotovec-Ellis, Alicia J; Shelly, David R; Hill, David P; Pitt, Andrew M; Dawson, Philip B; Chouet, Bernard A.

Sci Adv ; 4(8): eaat5258, 2018 08.

Artigo em Inglês | MEDLINE | ID: mdl-30116785

RESUMO

Although most volcanic seismicity is shallow (within several kilometers of the surface), some volcanoes exhibit deeper seismicity (10 to 30+ km) that may reflect active processes such as magma resupply and volatile transfer. One such volcano is Mammoth Mountain, California, which has also recently exhibited high rates of CO2 discharge at the surface. We perform high-resolution earthquake detection and relocation to reveal punctuated episodes of rapidly propagating seismicity at mid-crustal depths along a narrow fracture zone surrounding a body of partial melt. We infer that these earthquakes track dike intrusions or fluid pressure pulses associated with CO2 exsolution, suggesting that the deep plumbing system of Mammoth Mountain is an active conduit for fluid transport from the base of the crust to the surface.

13.

Exploring autophagy with Gene Ontology.

Denny, Paul; Feuermann, Marc; Hill, David P; Lovering, Ruth C; Plun-Favreau, Helene; Roncaglia, Paola.

Autophagy ; 14(3): 419-436, 2018.

Artigo em Inglês | MEDLINE | ID: mdl-29455577

RESUMO

Autophagy is a fundamental cellular process that is well conserved among eukaryotes. It is one of the strategies that cells use to catabolize substances in a controlled way. Autophagy is used for recycling cellular components, responding to cellular stresses and ridding cells of foreign material. Perturbations in autophagy have been implicated in a number of pathological conditions such as neurodegeneration, cardiac disease and cancer. The growing knowledge about autophagic mechanisms needs to be collected in a computable and shareable format to allow its use in data representation and interpretation. The Gene Ontology (GO) is a freely available resource that describes how and where gene products function in biological systems. It consists of 3 interrelated structured vocabularies that outline what gene products do at the biochemical level, where they act in a cell and the overall biological objectives to which their actions contribute. It also consists of 'annotations' that associate gene products with the terms. Here we describe how we represent autophagy in GO, how we create and define terms relevant to autophagy researchers and how we interrelate those terms to generate a coherent view of the process, therefore allowing an interoperable description of its biological aspects. We also describe how annotation of gene products with GO terms improves data analysis and interpretation, hence bringing a significant benefit to this field of study.

Assuntos

Autofagia/genética , Bases de Dados Genéticas , Ontologia Genética , Doença de Parkinson/genética , Animais , Humanos , Anotação de Sequência Molecular , Proteínas/metabolismo

14.

Improving Interpretation of Cardiac Phenotypes and Enhancing Discovery With Expanded Knowledge in the Gene Ontology.

Lovering, Ruth C; Roncaglia, Paola; Howe, Douglas G; Laulederkind, Stanley J F; Khodiyar, Varsha K; Berardini, Tanya Z; Tweedie, Susan; Foulger, Rebecca E; Osumi-Sutherland, David; Campbell, Nancy H; Huntley, Rachael P; Talmud, Philippa J; Blake, Judith A; Breckenridge, Ross; Riley, Paul R; Lambiase, Pier D; Elliott, Perry M; Clapp, Lucie; Tinker, Andrew; Hill, David P.

Circ Genom Precis Med ; 11(2): e001813, 2018 02.

Artigo em Inglês | MEDLINE | ID: mdl-29440116

RESUMO

BACKGROUND: A systems biology approach to cardiac physiology requires a comprehensive representation of how coordinated processes operate in the heart, as well as the ability to interpret relevant transcriptomic and proteomic experiments. The Gene Ontology (GO) Consortium provides structured, controlled vocabularies of biological terms that can be used to summarize and analyze functional knowledge for gene products. METHODS AND RESULTS: In this study, we created a computational resource to facilitate genetic studies of cardiac physiology by integrating literature curation with attention to an improved and expanded ontological representation of heart processes in the Gene Ontology. As a result, the Gene Ontology now contains terms that comprehensively describe the roles of proteins in cardiac muscle cell action potential, electrical coupling, and the transmission of the electrical impulse from the sinoatrial node to the ventricles. Evaluating the effectiveness of this approach to inform data analysis demonstrated that Gene Ontology annotations, analyzed within an expanded ontological context of heart processes, can help to identify candidate genes associated with arrhythmic disease risk loci. CONCLUSIONS: We determined that a combination of curation and ontology development for heart-specific genes and processes supports the identification and downstream analysis of genes responsible for the spread of the cardiac action potential through the heart. Annotating these genes and processes in a structured format facilitates data analysis and supports effective retrieval of gene-centric information about cardiac defects.

Assuntos

Ontologia Genética , Cardiopatias , Proteômica , Biologia Computacional , Bases de Dados Genéticas , Coração , Cardiopatias/genética , Humanos , Anotação de Sequência Molecular , Fenótipo

15.

Modeling biochemical pathways in the gene ontology.

Hill, David P; D'Eustachio, Peter; Berardini, Tanya Z; Mungall, Christopher J; Renedo, Nikolai; Blake, Judith A.

Database (Oxford) ; 20162016.

Artigo em Inglês | MEDLINE | ID: mdl-27589964

RESUMO

The concept of a biological pathway, an ordered sequence of molecular transformations, is used to collect and represent molecular knowledge for a broad span of organismal biology. Representations of biomedical pathways typically are rich but idiosyncratic presentations of organized knowledge about individual pathways. Meanwhile, biomedical ontologies and associated annotation files are powerful tools that organize molecular information in a logically rigorous form to support computational analysis. The Gene Ontology (GO), representing Molecular Functions, Biological Processes and Cellular Components, incorporates many aspects of biological pathways within its ontological representations. Here we present a methodology for extending and refining the classes in the GO for more comprehensive, consistent and integrated representation of pathways, leveraging knowledge embedded in current pathway representations such as those in the Reactome Knowledgebase and MetaCyc. With carbohydrate metabolic pathways as a use case, we discuss how our representation supports the integration of variant pathway classes into a unified ontological structure that can be used for data comparison and analysis.

Assuntos

Metabolismo dos Carboidratos/fisiologia , Bases de Dados Genéticas , Ontologia Genética , Modelos Biológicos , Animais , Humanos

16.

Application of comparative biology in GO functional annotation: the mouse model.

Drabkin, Harold J; Christie, Karen R; Dolan, Mary E; Hill, David P; Ni, Li; Sitnikov, Dmitry; Blake, Judith A.

Mamm Genome ; 26(9-10): 574-83, 2015 Oct.

Artigo em Inglês | MEDLINE | ID: mdl-26141960

RESUMO

The Gene Ontology (GO) is an important component of modern biological knowledge representation with great utility for computational analysis of genomic and genetic data. The Gene Ontology Consortium (GOC) consists of a large team of contributors including curation teams from most model organism database groups as well as curation teams focused on representation of data relevant to specific human diseases. Key to the generation of consistent and comprehensive annotations is the development and use of shared standards and measures of curation quality. The GOC engages all contributors to work to a defined standard of curation that is presented here in the context of annotation of genes in the laboratory mouse. Comprehensive understanding of the origin, epistemology, and coverage of GO annotations is essential for most effective use of GO resources. Here the application of comparative approaches to capturing functional data in the mouse system is described.

Assuntos

Bases de Dados Genéticas , Ontologia Genética , Anotação de Sequência Molecular , Animais , Biologia Computacional , Genômica , Humanos , Camundongos , Análise de Sequência de DNA

17.

Methodology for the inference of gene function from phenotype data.

Ascensao, Joao A; Dolan, Mary E; Hill, David P; Blake, Judith A.

BMC Bioinformatics ; 15: 405, 2014 Dec 12.

Artigo em Inglês | MEDLINE | ID: mdl-25495798

RESUMO

BACKGROUND: Biomedical ontologies are increasingly instrumental in the advancement of biological research primarily through their use to efficiently consolidate large amounts of data into structured, accessible sets. However, ontology development and usage can be hampered by the segregation of knowledge by domain that occurs due to independent development and use of the ontologies. The ability to infer data associated with one ontology to data associated with another ontology would prove useful in expanding information content and scope. We here focus on relating two ontologies: the Gene Ontology (GO), which encodes canonical gene function, and the Mammalian Phenotype Ontology (MP), which describes non-canonical phenotypes, using statistical methods to suggest GO functional annotations from existing MP phenotype annotations. This work is in contrast to previous studies that have focused on inferring gene function from phenotype primarily through lexical or semantic similarity measures. RESULTS: We have designed and tested a set of algorithms that represents a novel methodology to define rules for predicting gene function by examining the emergent structure and relationships between the gene functions and phenotypes rather than inspecting the terms semantically. The algorithms inspect relationships among multiple phenotype terms to deduce if there are cases where they all arise from a single gene function. We apply this methodology to data about genes in the laboratory mouse that are formally represented in the Mouse Genome Informatics (MGI) resource. From the data, 7444 rule instances were generated from five generalized rules, resulting in 4818 unique GO functional predictions for 1796 genes. CONCLUSIONS: We show that our method is capable of inferring high-quality functional annotations from curated phenotype data. As well as creating inferred annotations, our method has the potential to allow for the elucidation of unforeseen, biologically significant associations between gene function and phenotypes that would be overlooked by a semantics-based approach. Future work will include the implementation of the described algorithms for a variety of other model organism databases, taking full advantage of the abundance of available high quality curated data.

Assuntos

Algoritmos , Redes Reguladoras de Genes , Anotação de Sequência Molecular , Fenótipo , Animais , Bases de Dados Factuais , Camundongos

18.

Representing kidney development using the gene ontology.

Alam-Faruque, Yasmin; Hill, David P; Dimmer, Emily C; Harris, Midori A; Foulger, Rebecca E; Tweedie, Susan; Attrill, Helen; Howe, Douglas G; Thomas, Stephen Randall; Davidson, Duncan; Woolf, Adrian S; Blake, Judith A; Mungall, Christopher J; O'Donovan, Claire; Apweiler, Rolf; Huntley, Rachael P.

PLoS One ; 9(6): e99864, 2014.

Artigo em Inglês | MEDLINE | ID: mdl-24941002

RESUMO

Gene Ontology (GO) provides dynamic controlled vocabularies to aid in the description of the functional biological attributes and subcellular locations of gene products from all taxonomic groups (www.geneontology.org). Here we describe collaboration between the renal biomedical research community and the GO Consortium to improve the quality and quantity of GO terms describing renal development. In the associated annotation activity, the new and revised terms were associated with gene products involved in renal development and function. This project resulted in a total of 522 GO terms being added to the ontology and the creation of approximately 9,600 kidney-related GO term associations to 940 UniProt Knowledgebase (UniProtKB) entries, covering 66 taxonomic groups. We demonstrate the impact of these improvements on the interpretation of GO term analyses performed on genes differentially expressed in kidney glomeruli affected by diabetic nephropathy. In summary, we have produced a resource that can be utilized in the interpretation of data from small- and large-scale experiments investigating molecular mechanisms of kidney function and development and thereby help towards alleviating renal disease.

Assuntos

Ontologia Genética , Rim/embriologia , Rim/metabolismo , Animais , Bases de Dados Genéticas , Bases de Dados de Proteínas , Humanos , Camundongos , Anotação de Sequência Molecular , Especificidade da Espécie , Estatística como Assunto

19.

A method for increasing expressivity of Gene Ontology annotations using a compositional approach.

Huntley, Rachael P; Harris, Midori A; Alam-Faruque, Yasmin; Blake, Judith A; Carbon, Seth; Dietze, Heiko; Dimmer, Emily C; Foulger, Rebecca E; Hill, David P; Khodiyar, Varsha K; Lock, Antonia; Lomax, Jane; Lovering, Ruth C; Mutowo-Meullenet, Prudence; Sawford, Tony; Van Auken, Kimberly; Wood, Valerie; Mungall, Christopher J.

BMC Bioinformatics ; 15: 155, 2014 May 21.

Artigo em Inglês | MEDLINE | ID: mdl-24885854

RESUMO

BACKGROUND: The Gene Ontology project integrates data about the function of gene products across a diverse range of organisms, allowing the transfer of knowledge from model organisms to humans, and enabling computational analyses for interpretation of high-throughput experimental and clinical data. The core data structure is the annotation, an association between a gene product and a term from one of the three ontologies comprising the GO. Historically, it has not been possible to provide additional information about the context of a GO term, such as the target gene or the location of a molecular function. This has limited the specificity of knowledge that can be expressed by GO annotations. RESULTS: The GO Consortium has introduced annotation extensions that enable manually curated GO annotations to capture additional contextual details. Extensions represent effector-target relationships such as localization dependencies, substrates of protein modifiers and regulation targets of signaling pathways and transcription factors as well as spatial and temporal aspects of processes such as cell or tissue type or developmental stage. We describe the content and structure of annotation extensions, provide examples, and summarize the current usage of annotation extensions. CONCLUSIONS: The additional contextual information captured by annotation extensions improves the utility of functional annotation by representing dependencies between annotations to terms in the different ontologies of GO, external ontologies, or an organism's gene products. These enhanced annotations can also support sophisticated queries and reasoning, and will provide curated, directional links between many gene products to support pathway and network reconstruction.

Assuntos

Ontologia Genética , Anotação de Sequência Molecular , Biologia Computacional/métodos , Humanos , Proteínas/genética

20.

TermGenie - a web-application for pattern-based ontology class generation.

Dietze, Heiko; Berardini, Tanya Z; Foulger, Rebecca E; Hill, David P; Lomax, Jane; Osumi-Sutherland, David; Roncaglia, Paola; Mungall, Christopher J.

J Biomed Semantics ; 5: 48, 2014.

Artigo em Inglês | MEDLINE | ID: mdl-25937883

RESUMO

BACKGROUND: Biological ontologies are continually growing and improving from requests for new classes (terms) by biocurators. These ontology requests can frequently create bottlenecks in the biocuration process, as ontology developers struggle to keep up, while manually processing these requests and create classes. RESULTS: TermGenie allows biocurators to generate new classes based on formally specified design patterns or templates. The system is web-based and can be accessed by any authorized curator through a web browser. Automated rules and reasoning engines are used to ensure validity, uniqueness and relationship to pre-existing classes. In the last 4 years the Gene Ontology TermGenie generated 4715 new classes, about 51.4% of all new classes created. The immediate generation of permanent identifiers proved not to be an issue with only 70 (1.4%) obsoleted classes. CONCLUSION: TermGenie is a web-based class-generation system that complements traditional ontology development tools. All classes added through pre-defined templates are guaranteed to have OWL equivalence axioms that are used for automatic classification and in some cases inter-ontology linkage. At the same time, the system is simple and intuitive and can be used by most biocurators without extensive training.

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA